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Abstract —We consider a setting in which a sender wishes 
to broadcast a block of K data packets to a set of wireless 
receivers, where each of the receivers has a subset of the data 
packets already available to it (e.g., from prior transmissions) 
and wants the rest of the packets. Our goal is to find a linear 
network coding scheme that yields the minimum average packet 
decoding delay (APDD), i.e., the average time it takes for a 
receiver to decode a data packet. Our contributions can be 
summarized as follows. First, we prove that this problem is NP- 
hard by presenting a reduction from the hypergraph coloring 
problem. Next, we show that a random linear network coding 
(RLNC) provides an approximate solution to this problem with 
approximation ratio 2 with high probability. Next, we present 
a methodology for designing specialized approximation algo¬ 
rithms for this problem that outperform RLNC solutions while 
maintaining the same throughput. In a special case of practical 
interest with a small number of wanted packets our solution 
can achieve an approximation ratio 4 ~|' A . Finally, we conduct 
an experimental study that demonstrates the advantages of the 
presented methodology. 

Index Terms —Network coding, decoding delay, NP-hardness, 
approximation algorithm. 

I. Introduction 

In this paper, we are interested in a wireless broadcast 
scenario, in which a sender wishes to broadcast a block of 
K data packets to a set of wireless receivers, such that each 
of the receivers already has a subset of the data packets 
available to it (e.g., from prior transmissions) and is interested 
in obtaining the rest of the packets. Given a packet reception 
instance, the goal is to design a linear network coding (NC) 
scheme that minimizes the average packet decoding delay 
(APDD), which is defined as the average time it takes for a 
receiver to decode a data packet. 

One of the possible solutions to this problem is to employ a 
random linear network coding (RLNC) technique [1], [2]. In 
wireless broadcast scenarios, RLNC can achieve an optimal 
throughput (i.e., minimize the time required to decode all 
packets by all receivers) with high probability by mixing 
all data packets in the block together using linear coeffi¬ 
cients randomly chosen from a sufficiently large finite field. 
However, RLNC is suboptimal in terms of APDD, since in 
general, no data packet can be decoded by a receiver until it 
receives K linearly independent coded packets. 

Many opportunistic NC techniques have been developed 
with the aim to reduce APDD or some other measures of 
decoding delay [2]—[17]. An important technique in this 
class is instantly decodable network coding (IDNC). The 
IDNC technique has a potential to reduce the APDD by 
enabling a subset of receivers to instantly decode a data 
packet after each transmission. IDNC has been shown to 
outperform RLNC in terms of APDD for a small number 
of receivers [15]. However, since in IDNC schemes a single 
transmission typically benefits only some of the receivers, 


IDNC is not throughput optimal. As a result, a larger number 
of transmissions is necessary to finish the broadcast, which 
increases the decoding delay for some receivers, and, as 
a result, increases the value of APDD. Indeed, for larger 
number of receivers, the throughput of IDNC decreases and 
APDD increases due to lack of coding opportunities. A 
similar behavior can be observed for other opportunistic 
coding techniques [5], 

In summary, there is no clear winner between RLNC, 
IDNC, and other opportunistic techniques, as each of them 
prevail in a different parameter region. Moreover, while 
the APDD of RLNC can be easily calculated (as shown 
in Section IV), the achievable APDD of opportunistic NC 
techniques has not been characterized analytically. 

The contributions of this paper is summarized as follows: 

• We first prove that it is NP-hard to minimize APDD, 
by presenting a reduction from the hypergraph coloring 
problem. 

• Next we show that RLNC achieves an approximation 
ratio of 2 with high probability, i.e., the APDD achieved 
by RLNC is at most two times the optimal solution. 

• We present a methodology for designing specialized 
approximation algorithms that achieve lower values of 
APDD than RLNC while maintaining the same optimal 
throughput. We also present a case study to demonstrate 
the algorithm design. We conduct extensive simulations 
to confirm that our methodology outperforms alternative 
solutions in the broad range of practical settings. 

II. System Model 

Our model includes a single sender that holds a set of 
K data packets that belong to F g , V = {pa-}^ 15 and a 
set of N receivers, {r„}^ =1 , each wants a subset W n of 
V and has the rest. The packet reception instance in our 
model is represented by a binary N x K state feedback matrix 
(SFM) A , where A(n , At) = 1 means that r„ wants p^, and 
A(n, k) = 0 means that receiver r n has packet p already 
available to it (e.g., from prior transmissions). We denote by 
w n the size of W n , and by tk the number of receivers who 
want pk . An example of SFM is given in Fig. 1(b), which 
has W\ = 3 and t\ = 2. 

Given A, the sender performs a linear NC transmission 
phase. In each NC transmission, the sender encodes data 
packets in A4 together using linear coefficients from a finite 
field F 9 . The corresponding packet A' takes the form of: 

x= (1) 

Pk&M 

We denote by A4 = {pk G V\Pk 7^ 0} the support of X 
and refer to it as an coding set of X. When the coefficients 
{Pk} are chosen from F ? uniformly at random, X is called 
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(b) State feedback matrix A 


Fig. 1. A hypergraph and its equivalent SFM A 


a random-coded packet of A4. A receiver will increase its 
degree of freedom (DoF) by one when it receives a NC packet 
that is linearly independent of the set of all the packets it 
already has. The broadcast will be completed at a receiver 
once it decodes all its wanted data packets. 

In order to study the global minimum decoding delay of 
linear NC in wireless broadcast, we assume the following: 

1) NC transmissions are erasure-free, so that every trans¬ 
mitted NC packet can be received by all receivers; 

2) Receivers have sufficient computational resources to 
perform NC decoding under any F g . When random 
coding is applied, a sufficiently large F 9 will ensure the 
linear independency among the random-coded packets 
with high probability. 

A set of U coded packets is called a NC solution and 
is denoted by S if it allows every receiver to decode all its 
wanted packets. Let u n> k be the index of the NC transmission 
at which r n decodes p/, . The average packet decoding delay 
(APDD) of <S, denoted by Dg, is calculated as: 

1 N K 

Ds = —Jj ( 2 ) 

Zjn=1 Wn n=1k =1 

Our aim in this paper is to study the smallest Dg over all 
possible linear NC solutions. We call it the minimum APDD 
of A and denote it by D mm . The first question we would 
like to answer is: Is it hard to find D m i n ? 

III. The Hardness of Finding D min 

In this section, we study the hardness of finding -D m ; n . To 
this end, we will first introduce the concept of perfect NC 
solution , whose APDD is a lower bound of D mm . Then, we 
will prove that deciding whether a perfect solution exists for 
a given instance A of the problem at hand is an NP-hard 
problem. This implies, in turn, that the problem of finding 
D m i n is also NP-hard. 


A. The Perfect Solution and a Bound of D m \ n 
Definition 1. A NC solution S is called a perfect solution 
and is denoted by S p if it allows every receiver n to decode 
a wanted data packet in every transmission of S. 


Obviously, S p offers the ideal packet decoding scenario. 
Its APDD is thus a lower bound of -D m j n , denoted by D_, 
which is calculated as: 
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It is clear that D can only be achieved by S p if a perfect 
solution exists. The natural question =in this context is: 
Does a perfect solution S p exist for every SFM? In the next 
subsection, by using a reduction from the strong hypergraph 
coloring problem, we will prove that this question is NP-hard 
to answer. 


B. Hardness of Finding S p 

We first introduce some useful concepts in hypergraphs. A 
hypergraph TL is defined by a pair (V, £), where V is the set 
of vertices, and £ is the set of hyperedges. Every hyperedge 
e £ £ is a subset of V with size |e| ^ 1. A hypergraph is 
r—uniform if every hyperedge e has equal size, i.e., |e| = r. 
A k-strong coloring solution of TL is a partition of V into k 
subsets {Vj}j =1 , such that |Vjfle| ^ 1 for any e £ £. In other 
words, every color appears at most once in every hyperedge. 
It is well known that the hypergraph coloring problem is 
intractable. 

Lemma 1 ([18]). It is NP-hard to determine whether an 
r— uniform hypergraph is r-strong colorable, for any r ^ 3. 

We then build a reduction from the strong hypergraph 
coloring solution for r-uniform hypergraphs to the problem 
of finding a perfect NC solution for the average delay min¬ 
imization problem. Given an r-uniform hypergraph TL(V,£) 
we construct an instance to our problem as follows. First, 
for each vertex Vk we introduce a data packet p;-, and for 
each hyperedge e n we introduce a receiver r„ who wants 
the data packets that correspond to vertices in e n . Note that 
in the resulting SFM A, every receiver wants r data packets. 
A 3-uniform hypergraph and the corresponding SFM matrix 
are depicted in Fig. 1. 

First, we prove that an existence of an r-strong coloring 
solution {V, }' =1 of TL implies a perfect solution for our 
problem. Let {V,;}’_ 1 be an r-strong coloring of TL. For each 
Vi, let Aij be a set of packets that correspond to vertices in 
Vi. Note that for each each receiver r n and each set Adi it 
holds that \M. i C\W n \ = 1. Consider a coding solution S that 
includes r transmissions, such that transmission i includes a 
sum of packets in Adi (over F g ). Since every receiver can 
decode a packet at each transmission, S is a perfect solution 
to our our problem. 

Next, we show that a perfect solution S p for the instance 
of A of our problem implies that there exists an r-strong 
coloring solution {V,;}[ =1 of TL. Let AA, be the coding set 
that corresponds to the transmission i of S p and let V, be the 
set of vertices in H that correspond to A it. Note that in order 
to allow every receiver to decode one data packet in each of 
the r transmissions, every Ad, must contain one wanted data 
packet of every receiver, i.e., \AdiC\Wn\ = 1. Thus, {Vi}’L 1 
is an r-strong coloring solution {V l } r i=i of TL. 

We conclude that an r-uniform hypergraph is r-strong 
colorable if and only if there exists a perfect NC solution 
of the instance A of our problem. We summarize our results 
in the following lemma: 

Lemma 2. It is NP-hard to determine whether there exists a 
perfect solution for a given instance A of minimum APDD 
problem. 

Proof: The theorem follows from our construction and 
Lemma 1. ■ 












C. The Hardness of Finding -D m ; n 

Since D can only be achieved by a perfect solution S p , an 
optimal algorithm that finds D m j n will be able to determine 
the existence of a perfect solution by comparing D m with 

D. According to Lemma 2, this decision is NP-hard to made, 
and thus it is NP-hard to find 

Theorem 1. It is NP-hard to find D mm for a given instance 
A of minimum APDD problem . 

In addition to NP-hardness, our reduction from the hyper¬ 
graph coloring also yields an interesting conjecture on the 
existence of perfect solution for some special instances of 
A. It comes from the famous Erdos-Faber-Lovasz conjecture 
in graph theory [19]: 

Conjecture 1 (Erdos-Faber-Lovasz [19]). Consider an 
r-uniform hypergraph with r hyperedges. Each pair of hyper¬ 
edges have at most one vertex in common. This hypergraph 
is r-strong colorable. 

The corresponding conjecture in NC context is as follows: 

Conjecture 2. Consider an instance A of our problem with 
r receivers, each wants r data packets, and each pair of 
receivers want at most one data packet in common. This A 
has a perfect NC solution S p . 

We showed that the problem of finding a minimum value 
of -D m i n is intractable. Accordingly, in the next sections we 
discuss approximation algorithms for this problem. 

IV. Approximating D min 

In this section, we aim at approximating D ]n \ u - An approx¬ 
imation algorithm of D m j„ produces a linear NC solution S 
with its APDD obeying D$ f cnD m i n . We refer to a ^ 1 as 
approximation ratio of the algorithm. 

In the next theorem we analyze the approximation ratio of 
the RLNC technique: 

Theorem 2. RLNC technique is at most a 2—approximation 
algorithm of D mln . 

Proof: In every RLNC transmission, the sender sends 
a random-coded packet of all data packets. With high prob¬ 
ability (that asymptotically goes to 1 with the field size), 
after receiving w n such packets, receiver r„ can decode 
all its wanted data packets by performing block decoding, 
i.e., solving a set of w„ linear equations. Hence, the APDD 
offered by RLNC is: 

1 N 

■Drlnc = w ™' ( 6 ) 

En =1 Wn rt=l 

Comparing Drlnc with the lower bound D in (5), we have: 
7?RLNC 

D 


Since /9 rnin ^ £>, Drlnc at most doubles l) mm - Thus, 
RLNC is at most a 2-approximation algorithm of D rnin . ■ 
Therefore, RLNC technique offers guaranteed APDD per¬ 
formance. On the other hand, to the best of our knowledge ex¬ 
isting opportunistic APDD-reduction techniques are not able 
to provide provable performance guarantees. For example, let 


us analyze a well-known APDD-reduction technique called 
instantly decodable network coding (IDNC). 

IDNC has two variations, strict IDNC (S-IDNC) [4], [9], 
[15] and general IDNC (G-IDNC) [12], Both of them have 
been shown to provide lower APDD than RLNC with a small 
number of receivers, but become worse than RLNC with 
increasing number of receivers. Due to the absence of the 
optimal G-IDNC algorithm [12], we are not able to prove 
whether G-IDNC approximates or not. However, we 

are able to prove the following statement for S-IDNC: 

Lemma 3. S-IDNC does not provide a constant approxima¬ 
tion ratio for the minimum APDD problem. 

Proof: To prove this, it suffices to provide a counter 
example. Consider a complete graph Q (V. £) with I\ vertices 
and K(K — l)/2 edges. For every vertex ?;/, we generate a 
data packet p/ ;; . For every edge e l;) that connects v, and 
Vj we generate a receiver r n with W n = {p, ,p_,}. In the 
resultant A, every receiver wants two data packets. 

S-IDNC prohibits to code together any two data packets 
that are both wanted by any receiver. In other words, every 
S-IDNC coding set A4 must satisfy \A4 (~l W n \ ^ 1 for any 
receiver r n . Given the above A , this restriction implies that 
no data packets can be coded together at all. Hence, all K 
data packets must be broadcast uncoded alone. The resultant 
APDD is (K+ l)/2. 

Note that it is easy to show that the optimal value of APDD 
is at most 2. Indeed, the value of 2 can be achieved by 
using the RLNC technique. Thus, S-IDNC fails to provide 
a constant approximation ratio for the problem at hand. 

■ 

In conclusion, in this section we proved that RLNC is at 
least a 2-approximation algorithm of D m j n . By setting RLNC 
as a benchmark, we showed that S-IDNC fails to provide a 
constant approximation ratio for our problem. Indeed, RLNC 
is the only existing approximation algorithm, to the best of 
our knowledge. Therefore, the final question we are interested 
in is: How to overtake RLNC! 

V. How to Overtake RLNC 

Imagine a linear NC technique that: 1) is throughput 
optimal as RLNC (i.e., allows every receiver to increase its 
DoF by one in every transmission [6]); and 2) enables early 
packet decodings rather than block decodings in RLNC. Such 
an NC technique offers an APDD lower than RLNC, and 
thus will be an approximation algorithm of I) ]UUI with a ratio 
lower than RLNC. To the best of our knowledge, such NC 
techniques have not been developed in the literature. 

In this section, we propose a methodology for the develop¬ 
ment of such NC techniques. We first construct a hypergraph 
TT that corresponds to a given instance A. The key idea to 
guarantee optimal throughput and early packet decodings is 
to find minimal vertex covers in TT. A vertex cover is a subset 
Vc of V such that \Vc IT e n \ ^ 1 for every hyperedge. It is 
minimal if it is not the superset of a smaller vertex cover, 
implying that |Vc H e„| = 1 for at least one hyperedge. 
Hence, every receiver wants at least one data packet from 
Vc , and at least one receiver can instantly decode a wanted 
data packet from Vc- 

The core algorithmic structure of our methodology is 
sketched in Algorithm 1. It generates a solution S with coding 
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Algorithm 1 Structure of approximation algorithms of D mm 
1: Initialize: SFM A; 

2: Construct the hypergraph H(V,£) of A by mapping data 
packets to vertices and receivers to hyperedges; 

3: while Every hyperedge e n is non-empty do 
4: Find a minimal vertex cover Vc of 77, and send a 

random-coded packet of data packets in Vc', 

5: Update 77 by removing Vc from it following some 

strategy; 

6 : end while 

7: Send random-coded packets of all data packets until all 
receivers complete broadcast. 


sets {V^, • • • , Vc, V, V, ■ ■ ■ }, where L is the total number of 
minimal vertex covers found by the algorithm. To achieve op¬ 
timal throughput, the random-coded packet of every { V l c }/l 1 
must be able to increase every receiver’s DoF by one. To this 
end, a proper hypergraph update strategy must be applied. 
The simplest strategy is to completely remove V l c from 77 
before finding V\f l . By doing so, all the vertex covers will 
have empty intersections, and thus serve all the receivers with 
different data packets. The algorithm stops at the L- th round 
when there is at least one empty hyperedge, after which point, 
optimal throughput is maintained by sending random-coded 
packets of all data packets in V , as in RFNC. Hence, the 
solution S is throughput optimal as RFNC. Moreover, since 
the minimal vertex covers enable instant packet decodings, 
the APDD of S is better than RFNC. 

The design of optimal hypergraph vertex cover algorithms 
and hypergraph update strategies that minimize APDD is 
still an open problem. However, regardless of whether op¬ 
timal or heuristic algorithms/strategies are applied, solutions 
generated by Algorithm 1 are always throughput optimal, 
while also providing early packet decodings. Thus, they can 
approximate D m j n with ratio smaller than RFNC. 

Fig. 2 compares the APDD performance of a simple real¬ 
ization of our methodology with RFNC and a heuristic G- 
IDNC [20]. In this realization, we adopt the aforementioned 
complete Vc removal strategy and a heuristic hypergraph 
vertex cover algorithm, which iteratively adds to Vc the 
vertex that 1) is not connected to Vc; and 2) has the highest 
degree 1 . To generate the SFM instances, we consider a block 
of K = 15 data packets and assume that each receiver wants 
each data packet randomly with a probability of 0.2. The 
number of receivers N £ [5,100]. The results show that G- 
IDNC offers the lowest APDD when N is small, but becomes 
worse than RFNC when N > 65. Hence, the heuristic G- 
IDNC is not an approximation algorithm. Our realization 
(heur. VC in the figure) always outperforms RFNC. Their 
gap narrows down with increasing N. 

To gain a deeper insight into the realization and perfor¬ 
mance analysis of the proposed methodology, we conduct a 
case study in the next subsection by considering a special 
type of SFM. 

A Case Study 

In this subsection, we design a NC technique that ap¬ 
proximates the D m i n of a special type of SFM where every 
receiver wants two data packets from the packet block V. 
Though seemingly simple, this case is highly nontrivial, 

1 The degree of a vertex is the number of hyperedges incident to it 



Fig. 2. The APDD of heuristic VC. RLNC, and G-IDNC when K = 20. 



Fig. 3. An example of MIS partition. 

because its D mm cannot be approximated by existing APDD- 
reduction techniques such as S-IDNC, as shown in the proof 
of Theorem 3. 

Given such an SFM A, we first construct its hypergraph 
model by mapping data packets into vertices, and mapping 
receivers into hyperedges. Note that multiple receivers who 
want the same set of data packets are represented by one 
hyperedge. In addition, we weight every vertex Vk with a 
value of tk, which is the number of receivers who want p/.. 
We note that since every hyperedge has |e| =2, the resultant 
hypergraph is indeed a classic graph Q(V,£). 

We then partition V into two subsets: 

• The first subset is a minimal vertex cover Vc- Since 
every edge is incident to Vc, every receiver wants at 
least one data packet from Vc- Denote by IZc the 
receivers who want two data packets from Vc, and by 
Nc their number; 

• The second subset is a set V/ = V \ Vc- It is obvious 
that Vi is a maximal independent set, because it contains 
no edge (otherwise Vc is not a minimal vertex cover). 
Hence, every receiver wants at most one data packet 
from Vi. We denote by IZi the set of receivers who 
want one data packet from V/, and by TV/ their number. 
We have Nc + Ni = N. 

An example of such partition is demonstrated in Fig. 3. It has 
4 data packets and 5 receivers, with 4 of them want one data 
packet from both V/ and Vc- 

We then send the following two NC packets: 

• In the first transmission, send an RFNC packet of all 
data packets in Vc- This allows 7 Zi to decode one data 
packet, and allows 7 Zc to increase DoF by one without 
decoding; 

• In the second transmission, send an RFNC packet of all 
data packets in V. This allows IZi to decode the other 
data packet, and allows 7 Zc to decode two data packets. 

We call this technique maximal independent set (MIS) tech¬ 
nique. MIS follows Algorithm 1, and thus is an approxi- 




























Fig. 4. The APDD of the proposed technique, RLNC, and G-IDNC when 
K = 20 and w = 2. 


mation algorithm of D m i n . Its APDD, denoted by Dmis, is 
calculated as: 


Dmis 


AT/ • 1 + AT/ • 2 + 2AT C • 2 _ o AT/ 
27V ~~ 27V 


( 8 ) 


which is minimized when TV/ is maximized. Since TV/ = 
Y^vkGVi we ncet ^ to find the maximum weighted inde¬ 
pendent set V/, which is NP-hard [21]. Nevertheless, even a 
heuristically finding V/ can offer TV/ > 0. Hence, Dmis < 2 
regardless of the way V/ is found. 

We now derive the worst approximation rate of MIS by 
calculating an upper bound on Dmis- The minimum size of 
V/ is one, taking place when Q is complete. In this case, the 
optimal MIS will find the solo vertex with the largest weight. 
Hence, 7V//7V is minimized when all vertices have the same 
weight. In this case, we have Dmis is thus upper 

bounded as Dmis ^ 2 — Then, by noting that D = 1.5 
when every receiver wants 2 data packets, we conclude that: 


Theorem 3. MIS is at most a 4 2 J K -approximation algo¬ 
rithm of D m i n when every receiver wants 2 out of I\ packets. 


Fig. 4 compares the simulated APDD performance of MIS 
with RLNC and the heuristic G-IDNC [20]. The packet block 
size is I\ = 20. The number of receivers TV £ [5,100]. 
Every receiver randomly chooses two wanted data packets. 
Since K = 20, Dmis is upper bounded by 2 — = 1.95. 

The optimal Dmis is obtained by exhaustively searching the 
maximum weighted independent set. Both the performance 
of heuristic Dmis and heuristic G-IDNC are obtained by 
using the heuristic maximum weighted clique (a complete 
subgraph of a graph) search algorithm proposed in [20]. This 
algorithm can be adapted for MIS because an independent set 
of C? is a clique of the complementary graph Q. According 
to the results, both the optimal and heuristic Dmis are well 
below their upper bound, and are much better than both G- 
IDNC and RLNC. On the other hand, the APDD of G-IDNC 
exceeds RLNC when the number of receivers becomes large. 


VI. Conclusion 

In this paper, we proved that it is NP-hard to minimize 
the average packet decoding delay (APDD) in packet block 
based wireless broadcast using linear network coding. But the 
minimum APDD can be approximated by RLNC with a ratio 
of at most 2. In order to achieve a lower approximation rario, 
we proposed a methodology for the design of specialized 
approximation algorithms that always outperform RLNC. 


In the future, we are interested in designing more sophisti¬ 
cated realizations of the proposed NC framework. We are also 
interested in its extension to more general network settings, 
for example, when NC transmissions are subject to erasures. 
Besides, our hypergraph model and delay analysis may be 
extended to other network models such as cooperative data 
exchange and distributed data storage, because they also have 
similar types of demands on data packets. 
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